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GENERIC-CASE COMPLEXITY, DECISION PROBLEMS IN 
GROUP THEORY AND RANDOM WALKS 

ILYA KAPOVICH, ALEXEI MYASNIKOV, PAUL SCHUPP, AND VLADIMIR 

SHPILRAIN 

Abstract. We give a precise definition of "generic-case complexity" 
and show that for a very large class of finitely generated groups the 
classical decision problems of group theory - the word, conjugacy and 
membership problems - all have linear-time generic-case complexity. We 
prove such theorems by using the theory of random walks on regular 
graphs. 
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1. Motivation 

Algorithmic problems such as the word, conjugacy and membership prob- 
lems have played an important role in group theory since the work of 
M. Dehn in the early 1900's. These problems are "decision problems" which 
ask for a "yes-or-no" answer to a specific question. For example, the word 
problem for a finitely presented group G = (xi, . . . ,Xk\r\, . . . , r m ) asks, 
given a word w in {x\, . . . , Xk}^ 1 , whether or not this word represents the 
identity element of G. The classical result of P. S. Novikov and of W. Boone 
states that there exists a finitely presented group with unsolvable word prob- 
lem. This implies that most other problems (the conjugacy, membership, 
isomorphism, and order problems) are also unsolvable in the class of all 
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finitely presented groups (see the survey papers |, H| for a detailed expo- 
sition). 

With the advance of modern computers interest in algorithmic mathemat- 
ics shifted to the realm of decidable problems with a particular emphasis on 
complexity of algorithms, and in the 1970s modern complexity theory was 
born. It quickly turned out that some decidable problems which one would 
really like to solve are too difficult to be solved in full generality on ac- 
tual computers. Among different possible complexity measures the most 
important for us here is time complexity. Usually, algorithms with linear, or 
quadratic, or sometimes even with high degree polynomial time complexity, 
are viewed as fast algorithms. Fortunately, several classes of infinite groups 
have fast algorithms for their decision problems. For example, the word and 
conjugacy problems for any word-hyperbolic group are solvable in linear and 
in quadratic time respectively, and the word problem for a linear group over 
the field of rational numbers can be solved in cubic time. On the other hand, 
there are finitely presented groups whose word problem has arbitrarily high 
time complexity. For a group with exponential time complexity of the word 
problem any algorithm solving the word problem needs at least exponen- 
tially many steps (in length of the word) to halt on infinitely many inputs. 
This type of analysis concerns the worst-case behavior of an algorithm and 
is now often called worst-case complexity. 

Many algorithms for solving the word problem in finitely presented groups 
are difficult to analyze and their worst-case complexity is not known. For 
example, for the Magnus algorithm for the word problem for one-relator 
groups |4j| we do not even know if the complexity is bounded above by 
any fixed tower of exponentials. Yet anyone who has conducted computer 
experiments with finitely presented groups knows that there is often some 
kind of an easy "fast check" algorithm which quickly produces a solution for 
"most" inputs of the problem. This is true even if the worst-case complexity 
of the particular problem is very high or the problem is unsolvable. Thus 
many group-theoretic decision problems have a very large set of inputs where 
the (usually negative) answer can be obtained easily and quickly. Indeed, 
our intuition on the subject has been formed by computer experiments and 
the main purpose of this paper is to explain some of this phenomenon. It 
turns out that a precise mathematical explanation comes from the theory of 
random walks on regular graphs. 

The kind of situation which we have in mind is often analogous to the 
use of Dantzig's Simplex Algorithm for linear programming problems. This 
algorithm is used hundreds of times daily and in practice almost always 
works quickly. The examples of V. Klee and G. Minty 41] showing that one 
can make the simplex algorithm take exponential time are very special. A 
"generic" or "random" linear programming problem is not "special" , and the 
algorithm works quickly. Observations of this type led to the development 
of average-case complexity. There are several different approaches to the 
average-case complexity, but they all involve computing the expected value 
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of the running time of an algorithm with respect to some measure on the 
set of inputs (for example, see [37, |42|), 

To study generic-case complexity, which deals with the performance of an 
algorithm on "most" inputs, we first need a notion of which sets are generic. 
Let v be a probability distribution on X* , or, more generally, an arbitrary 
additive function with values in [0, 1] defined on some subsets of the set X* 
of all finite words over a finite alphabet X. A subset T C X* is called 
generic with respect to v if u{X* — T) = 0. Then, for example, we would 
say that an algorithm f2 has polynomial-time generic-case complexity with 
respect to v, if f2 runs in polynomial time on all inputs from some subset T of 
X* which is generic with respect to v. Of course, we can define generic-case 
complexity being in any complexity class C, not only for polynomial time. 

Thus "generic-case" complexity is in the spirit of but quite different from 
average-case complexity in several respects. First of all, in average-case 
complexity the decision problem considered must be decidable and one has 
to have a total algorithm to solve it. One is then interested in the expected 
value of the running time of the algorithm. On the other hand, in generic- 
case complexity we consider the behavior of the algorithm only on a generic 
set T and completely ignore its behavior elsewhere. Thus we consider par- 
tial algorithms which may only halt on the set T and the total problem 
being considered can have arbitrarily high worst-case complexity or even be 
undecidable. 

The general idea of generic behavior in the context of group theory was 
introduced by M. Gromov [34] when he defined the class of word-hyperbolic 
groups. Gromov indicated that "most" finitely presented groups are word- 
hyperbolic. This was made precise by A. Ol'shanskii |5(J and also by 
C. Champetier |i~9| ] who formalized the notion of a "generic" group-theoretic 
property. Further research on generic group-theoretic properties has been 
done by C. Champetier 0, |2l|, G. Arzhantseva [|, §, |, §], A. Zuk 



(unpublished), P.-A. Cherix with co-authors [22, 23] and others. Recently 
M. Gromov [|36| pushed his ideas about "random groups" further with the 
goal of constructing finitely presentable groups that do not admit uniform 
embeddings into a Hilbert space. 

The notion of genericity in the work cited above concerns the collection of 
all finitely presented groups. In this paper we shift the focus to considering 
generic properties of algorithmic problems in individual groups with respect 
to asymptotic density (see Section ||) . 

We are grateful to Laurent Bartholdi and Tatiana Smirnova-Nagnibeda 
for many helpful discussions regarding random walks on groups and graphs, 
to Carl Jockusch and Frank Stephan for discussions on the general idea of 
generic-case complexity and to Bogdan Peternko for suggesting the use of 
Stolz' theorem. 
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Convention 2.1. We follow the book Computational Complexity of C. Pa- 



padimtriou [51] for our conventions on computational complexity. Recall 
that a complexity class is determined by specifying a model of computation 
(which for us is always a multi-tape Turing machine), a mode of compu- 
tation (e.g. deterministic or non-deterministic), resources to be controlled 
(e.g. time and space) and bounds for each controlled resource, that is func- 
tions f(x) such that for each input word w at most /(|w|) units of the 
specified resource needs to be extended by an appropriate Turing machine 
to reach a decision. 

In this paper, unless specified otherwise, when talking about a "complex- 
ity class C" , we assume that the resources to be controlled in the definition of 
C are either time or space or a combination thereof. We also assume that the 
collection of functions bounding each resource consists of proper complexity 



functions f(n) > (see [51] and Section Sectiomlang below) and that for 
any function f(n) in this collection and for any integer C > 1 the function 
C f{Cn + C) + C also belongs to this collection. Most reasonable complexity 
classes, such as linear time, polynomial space, log-space, etc, satisfy these 
restrictions. 

Recall that a decision problem is a subset T> of the set 

(X*) k = X* x ••• x X* 

{k > 1 factors), where X* is the set of all words on a finite alphabet X. (By 
introducing an extra alphabet symbol "," we could view a fc-tuple of words 
(wi,w<2, . . . , Wk) G (X*) k as a single word in the alphabet X U {, }.) 

In this section we focus on three classical decision problems for a given 
finitely generated group G: the word problem (WP), the conjugacy problem 
(CP), and the subgroup membership problem (MP). (Our approach is quite 
general and can be applied to other group-theoretic decision problems, such 
as the order of an element problem.) To formulate these problems precisely 
one needs to specify exactly how the group G is "given". To do this, one 
chooses a finite set of generators A of a group G, that is, one fixes a map 
7r : A — > G such that G = (ir(A)}. To simplify notation we identify elements 
of A with their images under ir in G. Put X = A U A^ 1 . Thus every word 
w G X* represents an element n(w) G G. 

Now we are ready to formulate the algorithmic problems above with re- 
spect to the given set of generators A: 

(WP) Given a word w G X* determine whether or not w represents the 
identity element in G (symbolically, w =g 1)- Thus 

WP{G, A) := {w G X* | w = G 1}. 

(CP) Given two words u, v G X* determine whether they represent conjugate 
elements of G or not. Thus 

CP(G,A) := {(u,v) G X* x X* \ n(u), ir(v) are conjugate in G}. 
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(MP) Let H < G be a fixed finitely generated subgroup. Given a word 
u G X* determine whether or not u belongs to H. Thus 

MP(G, H, A) := {w G X* \ ir(w) G 

Convention 2.2. We call these problems the A-versions of the correspond- 
ing problem about G to emphasize the choice of generators A. We use the 
notation D to denote a problem about a group G and we denote by T>a the 
^4-version of D corresponding to the finite generating set A of G. Thus if D 
is the word problem for G, then T>a = WP(G, A). If D is a problem about 
a group G and C is a complexity class, we say that D is solvable for G with 
complexity in C if for every finite generating set A of G the language T>a is 
in C. 

Of course, instead of the problems over X* one can consider decision prob- 
lems only over freely reduced words, that is, decision problems V C F(A) k , 
where F(A) is the free group on A. Since one can easily (in linear time) 
reduce a word in X* to its reduced form in F{A) these two decision prob- 
lems are equivalent with respect to time complexity classes. In average-case 
or generic-case complexity, where the measure on the set of inputs matters, 
the equivalence between these two points of view needs to be verified. Most 
of our results are unchanged if we take F{A) rather than X* as the set of 
inputs. 

If Y is another finite set of generators for G and Vy is the Y-version 
of the decision problem T> then these two decision problem are equivalent 
from the point of view of worst-case complexity. Indeed, every generator 
x G X = A U A^ 1 can be written as a word in F(Y). Thus every word in 
X* can be re-written in linear time as a word in Y* representing the same 
group element. This provides a linear-time reduction of T>a to T>y , and vice 
versa. Thus the worst-case complexity of group-theoretic decision problems 
does not depend on the choice of a finite generating set and is a true group 
invariant. By contrast, in the average or generic-case complexities a change 
in generating sets might conceivably give a different result and we will make 
invariance part of our definition. All of the results proved in this paper are 
invariant under change of a generating set. 

A more complicated class of algorithmic problems can be described as 
witness problems. Unlike decision problems, a "witness problem" asks to 
produce, for a given element u G V, an explicit justification or "proof" of 
the fact that u is, indeed, in T>. For example, the "witness" version of the 
Word Problem for a presentation (A\ R), given a word u G ncl(R), asks for 
an explicit expression of u as a product of conjugates of elements from i? ±1 . 

t 

i=i 

where Uj G ^(^4), rj G R, and ej = ±1. 
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The witness Conjugacy Problem would require producing a conjugating 
element, and the witness Membership Problem would ask for an expres- 
sion of a given u G (v\, ... ,Vk) as a product of the given generators (and 
their inverses) of the subgroup considered. Although witness problems are 
increasingly important (for example, in group-based cryptography ||]), we 
concentrate here on the traditional decision problems. 

Suppose we have a total algorithm £l\ solving a decision problem T> and 
also a partial algorithm 0,2 solving the problem generically with low generic- 
case complexity. Then by running Q,\ and VL2 m parallel we obtain a new 
total algorithm f2 = jf^ which solves T> with low generic-case complexity. 
The idea of putting these two algorithms together is in fact used by many 
practical experimenters. That is, for a particular problem one should look 
both for an exact solution with minimal known worst-case complexity and 
for a partial "generic" solution which will work very fast on most inputs. The 
computational group theory package "Magnus" already uses this philosophy 
very substantially, as most problems there are attacked by several algorithms 
running in parallel, including "fast checks" working with abelianizations and 
other quotients. We refer the reader to the article of G. Baumslag and 
C. F. Miller |l3| for a more detailed discussion on "Magnus". More recently, 
several applications of genetic algorithms in group theory jl^, [37]] revealed 
that some classical problems that were believed to have only "too slow", 
i.e., non-practical, solutions, admit a very fast solution generically. This, as 
well as numerous computer experiments, provided an important source of 
intuition for the present paper. 

If the generic-case complexity of f?2 is very low and the worst-case com- 
plexity of the total algorithm f^i is not too high, then the combined al- 
gorithm may have low actual average-case complexity. The idea of using 
generic-case results to prove average-case results in this way seems very 
fruitful, and we have already been able to obtain some interesting results 
which will be the subject of a future paper. 

3. Generic-case complexity 

We have stressed that in order to measure the "largeness" of a set of words 
on an alphabet one needs a measure or, at least, an additive positive real- 
value function defined on some sets of words in the alphabet. For this paper 
we use the asymptotic density function suggested in the work of A. Borovik, 
A. Myasnikov and V. Shpilrain |i~7|l (see also [|l6| ) and similar in spirit to 
concepts considered by M. Gromov, A. Ol'shanskii and C. Champetier. 

Definition 3.1 (Asymptotic density). Let X be a finite alphabet with at 
least two elements and let (X*) k denote the set of all k-tuples of words on 
X. The length of a /c-tuple (w\, ...,Wk) is the sum of the lengths of the Wi. 
Let S be a subset of (X*) k . For every n > let B n be the set of all A;-tuples 
in (X*) k of length at most n. 

We define the asymptotic density p(S) for S in (X*) k as 



GENERIC-CASE COMPLEXITY 



7 



p(S) := limsup p n (S) 

n— *oo 

where 

\snB n \ 

pn{S) := 



\B n \ 



If the actual limit linin^oo p n {S) exists, we denote p{S) := p{S). 
In the case where the limit 

lim p n (S) = p(S) 



exists we shall be interested in estimating the speed of convergence of the 
sequence {p n (S)}. To this end, if a n > and linin^oo a n = 0, we will say- 
that the convergence is exponentially fast if there is < a < 1 and C > 
such that for every n > 1 we have a n < Ca n . Similarly, if linin^oo b n = 1 
(where < b n < 1), we will say that the convergence is exponentially fast if 
1 — b n converges to exponentially fast. 

Definition 3.2 (Generic sets). We say that a subset S C (X*) k is generic 
iip(S) = l. 

If in addition p n (S) converges to 1 exponentially fast, we say that S is 
strongly generic. 

What we have really defined is being generic with respect to p in the sense 
discussed in Section 1. Since we now fix this particular concept of being 
generic, we simply say "generic" for the rest of this paper. The complement 
of a generic set is termed a negligible set. We can define strongly negligible 
sets in a similar manner. In the following lemma we collect several simple 
but useful properties of generic and negligible sets. 

Lemma 3.3. Let S,T be subsets of (X*) k . Then the following hold: 

1) S is generic if and only if S is negligible. 

2) // S is generic and S C T then T is generic. 

3) Finite unions and intersections of generic (negligible) sets are generic 
(negligible). 

4) // S is generic and T is negligible, then S — T is generic. 

5) The collection B of all generic and all negligible sets forms an algebra 
of subsets of (X*) k . 

Now we can define generic-case complexity of algorithms. 

Definition 3.4 (Generic and strong generic performance of a partial algorithm). 
Let V C (X*) k be a decision problem and let C be a complexity class. Let ft 
be a correct partial algorithm for T>, that is, whenever Q reaches a definite 
decision on whether or not a tuple in (X*) k belongs to T>, that decision is 
correct. 
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We say that O solves V with generic- case complexity C if there is a generic 
subset S C (X* ) k such that for every tuple r 6 S the algorithm Q terminates 
on the input r within the complexity bound C. 

If in addition the set S is strongly generic, then we say that the partial 
algorithm solves the problem V with generic-case complexity strongly C. 

We again point out that we completely ignore the performance of O on 
tuples not in S and the definition thus applies to the case where T> has 
arbitrarily high worst-case complexity or is indeed undecidable. 

One can now define "generic" complexity classes of decision problems in 
the obvious way. 

Definition 3.5 (Generic complexity classes). Let C be a complexity class. 
Then Gen(C) denotes the class of all decision problems V for which there 
exists a partial algorithm solving T> with generic-case complexity C. Simi- 
larly, SGen(C) denotes the class of all decision problems T> for which there 
exists a partial algorithm solving V with generic-case complexity strongly 
in C. 

As we mentioned before, while the worst-case complexity of most group- 
theoretic decision problems does not depend on the choice of a finite gener- 
ating set for a group, it is not at all clear (and is probably false) that generic- 
case complexity per se is independent of the chosen set of generators. In 
order to have a true group-theoretic invariant, we need to incorporate such 
independence into the following definition. 

Definition 3.6 (Generic-case complexity of a decsiion problem D for a group 
Let G be a finitely generated group. Let D be an algorithmic problem about 
the group G. We say that the decision problem D for G has generic-case 
complexity in C (strongly in C if for every finite generating set A of G there 
exists a partial algorithm fl(A) which solves the problem T>a C (A U ^l -1 )* 
with generic-case complexity C (strongly in C). 

4. Main results 

In this section we formulate the main results of the paper. Even though 
our results regarding the word problem follow from the more general theorem 
about the membership problem (see Theorem B below), we state the word 
problem results first since most of the applications which we have in mind 
concern the word problem. 

The concept of a group being non-amenable plays an important role in 
our results but for now the reader needs only to remember that any group 
which contains a free subgroup of rank two is non-amenable. 

Theorem A. Let G be a finitely generated group. Suppose that G has a 
finite index subgroup that possesses an infinite quotient group G for which the 
word problem is solvable in the complexity class C. Then the word problem 
for G has generic-case complexity in the class C. Moreover, if the group G 
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is nonamenable, then the generic- case complexity of the word problem for G 
is strongly in C. 

There are a number of interesting immediate corollaries of the above re- 
sult. 

Corollary 4.1. Let G be a finitely generated group. 

1. Suppose G has a finite index subgroup that possesses an infinite word- 
hyperbolic quotient G. Then the word problem for G is generically in 
real time. Moreover, if G is non-elementary, then the word problem 
for G is strongly generically in linear time. 

2. Suppose G has a finite index subgroup that possesses an infinite au- 
tomatic quotient G. Then the word problem for G is generically in 
quadratic time. Moreover, if G is non-amenable, then the word prob- 
lem for G is strongly generically in quadratic time. 

3. Suppose G has a finite index subgroup that possesses an infinite quo- 
tient group G, that is linear over a field of zero characteristic. 

Then the word problem for G is generically in polynomial time. 
Moreover, if if G is not virtually solvable, then the word problem for 
G is strongly generically in polynomial time. 

Proof. It is well known that for any word-hyperbolic group and for any 
finite generating set of this group, there is a set of defining relators for 
which Dehn's algorithm solves the word problem in linear time in the length 
of the input word. Moreover, this linear-time algorithm can be carried out 
by a multi-tape Turing machine. This was first observed by Domanski and 
Anshel ||] (see also 0|for a detailed description of the algorithm). Moreover, 



Holt and Rees |38|, |39[ have proved that for a word-hyperbolic group the 
algorithm solving the word problem can be carried out by a multi-tape real- 
time Turing machine. 

It is also well known that any word-hyperbolic group is either virtually 
cyclic (in which case it is called elementary) or contains a free group of rank 
two (in which case it is called non- elementary). Thus every non-elementary 
word-hyperbolic group is non-amenable. Together with Theorem A this 
implies the first part of Corollary |4.1|. 

Similarly, the classical result of ]|27f| shows that for an automatic group 
with any finite generating set there is an algorithm which solves the word 
problem in quadratic time. Again, by Theorem A the second part of Corol- 



lary 4.1 immediately follows. 



An important result of R. Lipton and Y. Zalcstein [43] states that for a 
finitely generated group over a field of characteristic zero the word problem 
is solvable in log-space and hence in polynomial time. By a famous theorem 
of J. Tits [56], a finitely generated linear group that is not virtually solv- 



able contains a nonabelian free subgroup and hence is nonamenable. This, 



together with Theorem A, implies the third part of Corollary |4.l| . □ 
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Example 4.2. If G is any finitely generated group with infinite abelianiza- 
tion then G maps onto the infinite cyclic group and hence by Corollary |4,1| 
the word problem in G is solvable generically in linear time. This is also 
equivalent to being able to write G as an HNN extension in some way. The 
result thus applies to all knot groups, all Artin groups and infinite one-relator 
groups. 



Example 4.3. Let G be a finitely generated infinite virtually solvable group. 
Then G has a finite index subgroup that possesses an infinite virtually 
abelian quotient. Hence by Corollary LI the word problem in G is solv- 
able generically in linear time. 



Example 4.4. Recall that the n-strand braid group B n , where n > 3, is 
given by the presentation 

B n = (aij . . . ,On_i | ajOj+iaj = a i+ iaia i+ i, for i = 1, ...,n - 2, 
and CLidj = a^ai for \% — j\ > 1) 

The pure braid group P n corresponds to those n-strand braids where every 
strand ends in the same position that it begins. Then P n is a normal sub- 
group of index n! in B n and B n /P n is isomorphic to the symmetric group S n . 
While it is hard to map one braid group onto another, this task is easy with 
pure braid groups: for n > 4 the group P n maps onto P n -\ by "pulling out" 
the last strand of a braid. Thus for every n > 3 the group P3 is a quotient 
group of P n . It is well-known that P3 = F(a, b) x Z. Thus for each n > 3 
the group P n has a non-abelian free quotient F(a, b). Since P n is of finite in- 



dex in B n and since F(a, b) is nonelementary word-hyperbolic, Corollary 4.1 
implies that for n > 3 the groups P n and B n have word-problems solvable 
with generic-case complexity strongly in linear-time. 



Example 4.5. Let G = Aut(F n ) or G = Out(F n ) where n > 2. Then by 
looking at the action of an automorphism (an outer automorphism) of F n on 
the abelianization of F n we see that G maps onto the group GL(n, Z). Since 
the word problem in GL(n, Z) is solvable in quadratic time and GL(n, Z) is 
non-amenable (it contains a nonabelian free subgroup), Corollary |4. 1| implies 
hat the word problem for G is solvable strongly generically in quadratic time. 

This observation raises the interesting question of determining the worst- 
case complexity of the word problem for G = Aut(F n ) , say with generators 
the elementary Nielsen automorphisms. We usually think that the "obvious 
algorithm" of checking if the action of an automorphism a fixes the gener- 
ators is very simple, but writing out all the intermediate steps could yield 
exponentially long words. It is not clear if there an algorithm with better 
worst-case complexity. 



Example 4.6. Theorem A holds even if G has unsolvable word problem. 
We consider the finitely presented Boone group B with unsolvable word 
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problem as described in J. Rotman's book [52]. One proves the word problem 
unsolvable by showing that equality between certain "special" words exactly 
mimics the word problem in a semigroup with undecidable word problem. 
We again have the situation that the complexity hinges on words of a very 
special form. It is easy to see that the group B has the nonabelian free group 
generated by all the r, as the quotient group which is obtained by killing all 
the other generators. Thus the stronger conclusion of the theorem applies 
and the generic-case complexity of the word problem for B is strongly linear 
time. This is not really surprising and is a precise version of the statement 
that the group B is "large" and the set of special words is really quite 
"sparse" . 

Example 4.7. Let G be a group with a finite presentation involving at 
least two more generators than relators. By the result of B. Baumslag 
and S. J. Pride Jll]] G has a subgroup of finite index that can be mapped 
homomorphically onto the free group of rank two. Hence by Corollary |4.l| 
G has word problem solvable strongly generically in linear time. 

In strong contrast with worst-case complexity is the fact that generic- 
case complexity for a problem D for a group G tells us nothing whatsoever 
about the complexity of D for subgroups of G. For example, if G is any 
finitely generated group, then G is certainly embedded in the direct product 
P = G x F(a, b) of G and the free group F(a, b) of rank two. We can apply 
Theorem |A] to P by taking the homomorphism to F(a, b) which kills all the 
elements of G. Since F(a, b) is hyperbolic and non-amenable, Theorem [A] 
implies that the word problem in P is strongly generically in linear time. 
But this says nothing at all about G because we just erased all information 
about G. This remark does show that every finitely generated group can be 
embedded in a finitely generated group whose word problem has generic-case 
complexity strongly in linear time. A well-known theorem of B. H. Neumann 
(see p2j) shows that there are continuumly many 2-generator groups, and 
thus there are continuumly many n-generator groups for every n > 2. Thus 
there are continuumly many finitely generated groups whose word problem 
has generic-case complexity strongly linear time. This is in sharp contrast 
with the fact that there are only countably many finitely generated groups 
with solvable word problem. 

The following computer experiment is easy to program. Let F n be a free 
group of rank n and let <p be the homomorphism from F n to F n _k defined 
by sending the the first k < n generators of F n to the identity. Pick a large 
length I and use a random number generator to generate a large number 
of random freely reduced words of length I. If one calculates the ratio of 
the number of words w with <p(w) ^ 1 to the total of number of words 
generated, one observes exactly the phenomena predicted by the theory of 
random walks. 

We now turn to the membership problem. It is necessary to discuss 
both a basic situation where the membership problem is solvable and also 
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a basic result about undecidability of the membership problem. We first 
observe that if G is any finitely generated group and H is a subgroup of 
finite index, then the membership problem for H in G is decidable in linear 
time . Choose a finite set A of generators of G. The Schreier coset graph 
r(G, H, A) is defined as follows. The vertex set V of T(G, H, A) is the set of 
cosets {Hg\g G G}. If y G A then there is an edge labeled by y from Hg to 
Hgy. Every edge in T(G, H, A) with label a £ A is equipped with a formal 
inverse edge labeled by a -1 . Thus T(G,H,A) is an oriented labeled graph. 

If A is finite and H has finite index in G then the graph T(G, H, A) is 
a finite. We can view T(G,H,A) as the transition graph of a finite state 
automaton M where the initial state and the only final state is the coset 
HI = H. By the definition of the coset graph, for any word w on the 
generators and their inverses, M accepts w if and only if w G H. Thus the 
membership problem for H is indeed decidable in linear time: given a word 
w G (A U A^ 1 )* , read w on the graph starting at the coset H and see if one 
ends back at the coset H. A generalized version of these ideas is currently 
important in geometric group theory. 

Theorem B. Let G be a finitely generated group and let H < G be a finitely 
generated subgroup of infinite index. Let G\ be a subgroup of finite index in 
G such that H < G\ and let <f) : G\ — > G be an epimorphism. Assume that 
H = <p(H) is contained in a subgroup K of infinite index in G and such that 
the membership problem for K in G is in the complexity class C. Then the 
membership problem for H inG has generic-case complexity inC. Moreover, 
if the Schreier coset graph T(G, K, A) is non-amenable (for some and hence 
any finite generating set A of G), then the generic-case complexity of the 
membership problem for H in G is strongly in C. 

The "strong" conclusion of Theorem |B| holds, for example, if G is non- 
elementary hyperbolic group and K is a quasiconvex subgroup of G. Indeed, 
in this case the coset graph T(G, K, A) is non-amenable by a recent result of 
I. Kapovich |4(]]. Since the membership problem for a quasiconvex subgroup 
of a hyperbolic group is solvable in linear time, Theorem [B] implies that the 
membership problem for H in G is strongly generically in linear time. 

Example 4.8. An Artin group is a group with a presentation 

(1) G = {ax, . . . , a n \uij = Uji, where 1 < i < j < n, ) 

be an Artin group where for i ^ j 

Uij . — aiOjOi . . . 
rriij times 

The Coxeter group C associated with G is the quotient group obtained 
by setting the squares of the generators equal to the identity. In general, 
the membership problem may be unsolvable for a Coxeter group or an Artin 
group. A Coxeter group or an Artin group is of extra-large type if all > 4. 
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Any Coxeter group of extra-large type with at least three generators is a non- 
elementary hyperbolic group. Appel and Schupp|5| solved the membership 
problem for subgroups generated by subsets of the given generators in Artin 
groups of extra-large type, but very little is known about the membership 
problem for arbitrary finitely generated subgroups. If H is a finitely gener- 
ated subgroup of an Artin group G such that the image H in the Coxeter 
quotient C and is quasiconvex then the membership problem for H in G has 
generic-case complexity strongly linear time. Schupp |53| showed that all 
groups in a very extensive class of Coxeter groups are locally quasiconvex, 
that is, every finitely generated subgroup is quasiconvex. Also, one can check 
whether or not a finitely generated subgroup has infinite index in quadratic 
time. This provides a large set of examples of finitely generated subgroups of 
Artin groups where the generic-case complexity of the membership problem 
is strongly in linear time. 



Example 4.9. A basic negative result about the membership problem is 
the theorem of K. Mihailova [46| that if P n = F n x F n is the direct product 
of two copies of the free group F n of rank n > 2, then there are subgroups 
H of P n with unsolvable membership problem (see p3|]). Let 



be a finitely presented group with unsolvable word problem. By using the 
well-known Higman-Neumann-Neumann embedding of a finitely presented 
group into a 2-generator group, we may assume that n is any integer which 
is at least 2. We use the ordered pair notation for elements of the direct 
product P n = F n x F n . Let H be the subgroup of P n with generators 

(xi,xi), (x n ,x n ), (l,n), (l,r m ) 
Since the are defining relators for G, an easy argument shows that 



(u, v) G H if and only if u = v in G 

Thus deciding membership in H is equivalent to solving the word problem 
in G. 

We point out that "genericity" is operating at three different levels when 
considering the membership problem. Let us fix P n as the direct product of 
two free groups of rank n. Call a subgroup H a subgroup of Mihailova type if 
H has a set of generators of the form (*) above, which is very special. If we 
choose a random set of generators for a subgroup, it is very unlikely that they 
will be even close to being of Mihailova type. The remarks above showed that 
membership in a Mihailova subgroup H is equivalent to the word problem for 
the group G whose defining relators are the r^. So just among subgroups of 
Mihailova type, if we choose the at random we encounter the phenomenon 
that finitely presented groups on a fixed set of generators are generically 
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hyperbolic and thus the membership problem for the corresponding H is still 
actually solvable in linear time. But Theorem B still applies to a particular 
Mihailova subgroup chosen to have unsolvable membership problem. All 
the known explicitly constructed groups with unsolvable word problem have 
at least infinite cyclic quotients, even after embedding into a two- generator 
group. That is, there is a homomorphism <j> from F n to 7L which sends all 
the Ti to the identity. Let ip be the homomorphism from P n to Q n = F n xZ 
defined by ip(u,v) = (u,(p(v)). The image H of H is F n x {1} which has 
infinite index in Q n . The membership problem for H in Q n is clearly in 
linear time since to decide if (u, v) £ H one only has to check if v equals 
the identity. If, for example, we use the Boone group B directly, without 
reducing the number of generators, to construct a Mihailova subgroup, then 
we have a homomorphism where the image H is the first factor of x F2 
and the generic-case complexity of the membership problem for H is strongly 
linear time. 

There is a similar theorem for the conjugacy problem. 

Theorem C. Let G be a non-cyclic finitely generated group with infinite 
abelianization. Then the generic-case complexity of the conjugacy problem 
for G is linear time. 

Theorem C is applicable to a wide variety of groups, such as infinite one- 
relator groups, Braid and Artin groups, knot groups etc. 

We shall see that the proof of the theorem reduces to the case of the word 
problem since two words are conjugate in an abelian group if and only if they 
are equal. The reader has probably noticed that a statement about strong 
generic-case complexity in the case of non-amenable quotients is missing 
from the theorem. At the present writing we do not have a proof which is 
invariant under changing the set of generators although we believe that such 
a theorem is true. 

A very interesting class of finitely presented groups with unsolvable con- 
jugacy problem is the class of residually finite groups with unsolvable conju- 



gacy problem constructed by C. F. Miller [47]. Given any finitely presented 
group G with unsolvable word problem, Miller shows how to construct a 
group M{G) which is the semidirect product of two finitely generated free 
groups (and which is thus residually finite) where conjugacy in M{G) codes 
the word problem for G. As usual, the "code words" have a special form. 
The groups M(G) have large nonabelian free quotients. We can show (al- 
though the argument is not presented in this article) that the conjugacy 
problem of such an M(G) has generic-case complexity which is strongly lin- 
ear time because the free quotient is obtained by simply killing some of the 
given generators. 

We again stress some important limitations of generic case complexity. 

First, just the definition of generic-case complexity does not say anything 
about the speed with which a particular sequence tends to one or zero. If 
the quotient group G is infinite but not "large enough", say G = Z, this 
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speed may in fact be much slower than the exponentially fast convergence 
which we are really aiming at. The weaker convergence is all that we have 
for general one-relator groups. 

Second, there is a substantial difference between our notion of "generic 
performance" and the notion of "average case complexity". In a situation 
like the word problem for one-relator groups where, although its complexity 
is not known, we at least have a total algorithm which is well understood, a 
future hope is to combine generic and worst-case methods to obtain average- 
case results. In this regard the work of jR], [l?]] about constructing explicit 
measures on free groups may be particularly useful. 

In general, our approach simply shows that for the "decision" version of 
the word and the membership problem the fast " No" answer component 
of the set of all inputs is very large. One may be mainly interested in 
some infinite recursive subset of inputs and many examples may not admit 
algorithms with fast generic performance when restricted to the subset of 
interest. 

Finally, our results do not say anything about the generic behavior of the 
"witness" versions of the word, conjugacy and membership problems. Yet it 
appears to us that if one has in mind practical cryptographic applications, 
these applications have to be based on the "witness" versions of the problems 
(rather than "decision" ones). 

Thus we regard this paper as just the first step in the direction of un- 
derstanding the generic-case and average-case behavior of various group- 
theoretic algorithms. 

The results which we discuss in the last section of the paper (on finding 
languages which are not in given generic complexity classes) are due to Carl 
Jockusch and Frank Stephan. For example, the set of languages over a finite 
alphabet A (with at least two letters) which are generically computable 
has measure zero (in a precise sense) in the set of all languages over A. 
Moreover, given any proper time-complexity function f(n) one can construct 
a language that is deterministically computable in time / 3 (n) but which 
cannot be generically computed in time f(n). 

These general results do not, however, answer the question of construct- 
ing finitely generated groups with decision problems of arbitrarily "high" 
generic-case complexity, say with a word problem which is not generically 
solvable. All our results in this paper are proved by the "quotient method" of 
finding an infinite quotient group in which the relevant problems have the de- 
sired complexity. Using the existence of two disjoint recursively enumerable 
sets which are not recursively separable and the Adian-Rabin construction, 
C. F. Miller III |5| constructed an example of a finitely presented group 
G all of whose nontrivial quotients have unsolvable word problem! This 
particular group G therefore completely defeats our method of proof but it 
may well be the case that the word problem has low generic-case complexity 
for some different reason. Indeed, it seems to be a very difficult problem 
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to construct a finitely generated group where the generic-case complexity of 
the word problem is provably not linear. 

5. COGROWTH AND SIMPLE RANDOM WALKS ON REGULAR GRAPHS 

The proofs of our theorems depend on already known nontrivial facts 
about the behavior of simple random walks on regular graphs. The really 
hard work is done by that theory, so we now turn to the results which we 
need. 

The subject of random walks on graphs and groups is vast and very active. 



We refer the reader to |Tq, 33 , pB, 33 , p4] for some background information 



and further references in this area. We will recall several basic definitions in 
facts which are directly needed in our arguments. 

Definition 5.1. Let T be a ci-regular graph (where d > 2) with a base- 
vertex Xq. 

Then let a n (T) = a n denote the number of reduced paths (i.e. paths 
without backtracks) of length n from xq to xq in r. Similarly, let 6^(r) — b n 
be the number of all paths of length n from xq to xo in T. Also let r n = r n (T) 
be denote the number of reduced paths of length at most n from xo to xo in 
r. Finally, let 

z n — ^n(r) denote the number of all paths of length at most 
n from xo to xo in T. Thus r n = Y^i=o a i an d z n = J27=o 
Put 



a(T) = a := lim sup \fa^, 

n— >oo 

f3(T) = (3 := lim sup \fhn 

and 

u(T) = v := - d (3(T). 

We shall refer to ot(T) as the cogrowth rate of T and to f(r) as the spectral 
radius of T. The number (3(T) will be called the non-reduced cogrowth rate 
of r. 

It turns out that the definitions of a(T), /3(T) and z^(r) do not depend on 
the choice of a base-point xo 6 T and we have (see for example |54|, |l8||): 

Lemma 5.2. Let T be a connected d-regular graph with a base-vertex xq, 
where d>2. Then: 

1. The values of a(T), /3(T) and v(T) do not depend on the choice of a 
base-point xq £ T. 

2. < a(T) <d-l,0< (3(T) < d and < i/(T) < 1. 

3. v = limsup n ^ 00 where is the probability that a simple ran- 
dom walk on T originating at xq will return to xq in n steps. 
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Definition 5.3. Let T be a d-regular graph where d > 2. We will say that 
r is amenable if = 1. 

An important result connecting cogrowth and spectral radius was first 
obtained by R.Grigorchuck [ 33 1 and J.Cohen [24| for Cayley graphs of finitely 
generated groups and later generalized by L.Bartholdi to the case of 
arbitrary regular graphs. 



Theorem 5.4. Let T be a d-regular graph (where d > 2). 
(3 = 0(T) and v = v(T). 
Then 



Let a = a(T), 



I 2 ^d 1 otherwise. 
In particular v < 1 <^=^ a < d — 1 (3 < d, that is v = 1 <^=^ a = 

d-1 <=^> (3 = d. 

The above theorem indicates that V is amenable if and only if it has 
maximal possible cogrowth for a d-regular graph. 

The following classical result is known as Stolz' Theorem (see for exam- 
pie ||): 

Lemma 5.5. Suppose x n ,y n are sequences of real numbers such that y n < 
y n +\ for every n with linin^oo y n = oo and such that a finite limit 

t 2-n+l X n 

lim 

n^oo y n+l - y n 

exists. Then 

lim = nm . 

n^co y n+1 - y n n->oo y n 

Lemma 5.6. Let c n > and c > 1 be such that lim n ^ 00 % = 0. Put 

fn = E"=0 °n- Then lim n^oo j£ = 

Proof. Applying Stolz' Theorem to x n = f n and y n = c n immediately yields 
Lemma □ 

Our principal technical tool is: 

Theorem 5.7. Let V be an infinite connected d-regular graph, where d > 3. 
Let a n = a n (r) and r n = r n (T). Then 

(i) 

lim Q " = lim ^ = 0. 

n^oo yd — l) n n^oo d n 



lim — — n = lim — — = 0. 

n— »oo (d — l) n n— >oo d n 
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Proof. This fact is essentially due to L.Bartholdi as it follows from the re- 
mark on p. 99 in (Kj. It was first obtained (in a stronger form) by W.Woess 



[33 1 for the case where T is the Cayley graph of a finitely generated group. 
We present briefly a formal argument for completeness. 

Notice that by Lemma ^6| (i) implies (ii) since r n = Yli=o an an( ^ Zr > 
J27=obn- We will now verify (i). 



Suppose first that a(T) < d — 1 and hence @(T) < d by Theorem 5.4 . 
Then there is iVo > 1 and < a < d — 1 such that for all n > iVo we have 
a n < CL n - Hence for n > Nq 

n a n 

< 7^ ► 



(d - l) n - (d - l) n n-oo 

as required. A similar argument implies that lmiy^oo b n /d n = 0. Hence 
the statement of Theorem |5.7| obviously holds. Thus we may assume that 



a(r) = d — 1, so that (3(T) = d and v(T) = 1 by Theorem 5A. Then the 



word-by-word repetition of the proof of Lemma 3.9 of [10] implies that 



lim - — Qn , = lim = 0. 

(d - l) n n^oo d n 



Indeed, Lemma 3.9 of []10[ proves a stronger version of Theorem |5.7| under 
the assumption that V is also quasi-transitive. However, the only place in the 
proof of Lemma 3.9 in JlQ] , where quasi-transitivity is used, is to conclude 
that /3(r) = d which is already known in our case. □ 

In case where V is non- amenable, we can say even more. 

Proposition 5.8. Let V be a non-amenable connected d-regular graph where 
d > 3 (and hence V is infinite). Let a n = a n (T), r n = r n (T), b n = b n (T) 
and z n = z n (T). Then 

1. Both ( rf °i)n — ► and ^L^n ~^ exponentially fast. 

2. Both % — > and — > exponentially fast. 

Proof. Since T is non-amenable, we have a = lim sup ^fa^ < d — 1 which 
immediately implies that ^"^ n — > exponentially fast. It also means that 
there are uq > 1 and 1 < a < d — 1 such that for any n > no we have 
a n < aU - Hence for n > uq 



r n -l 



n-n Q 
no" 



Thus there are A,B>0 such that for any n > no we have r n < A + Ba n . 
Since 1 < a < d— 1, this implies that ^z^n converges to zero exponentially 



fast. Thus part 1 of Proposition |5.8| is verified. 

The non-amenability of T implies = lim sup \fb~^ < d, which implies 
part 2 of Proposition by the same argument as above. □ 
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6. COGROWTH IN GROUPS 

Let G be a group with a fixed finite generating set A consisting of k > 1 
elements. If w is a word in A U ^4 _1 , we will denote by tt(w) the element 
of G represented by w. We will also denote by | -w | the length of the word 
w. For an element g G G denote by \g\A the length of a shortest word in 
A U A^ 1 representing g. Also, if Q is an alphabet, we will denote by Q* the 
set of all words in Q. For a subset SCGwe will denote by Sa the set of 
all words in (^4 U A^ 1 )* representing elements of S. 

Let H < G be a fixed subgroup (not necessarily normal). Let T = 
T(G,H,A) be the Schreier coset graph defined in Section se:l-4. Then T 
is a connected 2/c-regular graph. Note also that if H is normal in G, then F 
is precisely the Cayley graph of the group G/H with respect to the gener- 
ating set A. Thus every edge-path in T has a label which is a word in the 
alphabet A U A -1 . It is easy to see that for any word w and any vertex x of 
T there exists a unique path in T with label w and origin x. Moreover, if w 
is the label of a path p starting at the vertex xo := HI in T, then w G H if 
and only if the terminal vertex of p is also equal to HI. The graph-theoretic 
concepts from the previous section can now be re-stated as follows: 

a n (G, H, A) = #{w\w is a freely reduced word of length n in A U A^ 1 
with ir(w) £ H}, 

b n (G, H, A) = #{w\w is a word of length n in A U A^ 1 with n(w ) G .ff}, 

r n (G, i?, A) = #{w\w is a freely reduced word of length < n in A U ^4~ x 
with 7r(u?) G .ff} 

and 

z n (G, H, A) = #{w\w is a word of length < n in A U ^4 _1 with 7r(u>) G 

Proposition 6.1. Let G be a group with a fixed finite generating set S and 
let T = r(G, H, A) be the coset graph with base-vertex xq = HI. Then: 

a n (G,H,A)=a n (T), b n (G, H, A) = b n (T), r n (G , H , A) = r n (T) and 
z n (G,H,A) = z n (T). 

Proof. This fact follows directly from the definition of F = T(G, H, A) and 
the fact that a word w over A U A^ 1 represents an element of H if and only 
if the path in T staring at HI and labeled w terminates at HI. □ 

For this reason a(G, H, A) := a(T) is called the co-growth rate of H in G 
with respect to A and (3(G, H,A) := (3(F) is called the non-reduced co-growth 
rate of H in G with respect to A. Similarly, v(G, H, A) := u(F) is called the 
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spectral radius of H in G with respect to A. As before, a(G, H, A) < 2k — 1, 
(3(G, H, A) < 2k. Moreover a(G, H,A) = 2k-l if and only if (3(G, H, A) = 
2k if and only if T is amenable. 

It is easy to see (and it is well-known) that amenability of T(G, H, A) 
does not depend on the choice of a finite generating set A for G: 

Proposition 6.2. Let G be a finitely generated group and H < G be a 
subgroup. Suppose A, B are two finite generating sets for G. Put V = 
T(G, H, A) and V = T(G,H,B). Then T is amenable if and only if V is 
amenable. 



Proof. By Proposition 38 and Theorem 51 of Kg] amenability is a quasi- 
isometry invariant for regular graphs of finite degree. Let us equip T and 
r' with simplicial metrics d and d' accordingly. Let C := max{|a|s | o, £ A} 
and C' := max{|6|yi | b £ B}. Then for any two cosets Hg\,Hg2 we have 
d'(H gi ,Hg 2 ) < C'd(Hg 1 ,Hg 2 ) and d(H gi ,Hg 2 ) < Cd' \H 9l , Hg 2 ). Thus 
the identity map Id : (VT,d) —* (VT',d') is a quasi-isometry, which implies 
the statement of the proposition. □ 

According to the traditional definition, a finitely generated group G is 
called amenable if any action of G on a compact space Y by homeomor- 
phisms admits a G-invariant probability measure on 7. It turns out that if 
A is finite generating set of G and H < G is normal, then T = T(G, H, A) 
is amenable if and only if the quotient group G± = G/H is amenable. In 
particular G itself is amenable if and only if its Cayley graph T(G,A) is 
amenable. 

Suppose now that G = F = F(x\, . . . , Xk) is a free group of rank k > 2. 
It is easy to see that the number of vertices of the n-sphere in the Cayley 
graph of F with respect to the free basis A = {x±, . . . , x^} is 2k(2k — l) n 
for n > 1. Hence the number of elements of F in the n-ball around the 
identity is 1 + j^[(2k - l) n_1 - 1] for n > 1. 

Theorem 6.3. Let F = F(x\, . . . , x^) and let H < F be a subgroup, where 
k > 2. Put A = {xi,...,x k }. Let a n = a n (F,H, A), r n = r n (F, H, A) 
and a = a(F, H, A). Similarly, let b n = b n (F, H, A), z n = z n (F, H, A) and 
(3 = P(F,H,A). 

Then T is a 2k-regular graph, and a < 2k — 1, f3 < 2k . Moreover: 
1. If [F : H] = oo then 



and 



lim . , Qn . = lim — - — n - = 0. 

rwoo (2k - l) n n^oo (2k - l) n 



hm - - = lim - - ; = 0. 

n^oo (2k) n n^oo (2k) n 



Thus Ha has zero asymptotic density in (A U A 1 )* since the number 
of vertices in the ball of radius n > 1 in the Cayley graph T(F, A) is 
l + ^^-lf- 1 -!]. 



GENERIC-CASE COMPLEXITY 



21 



2. If the coset graph for F relative H is non-amenable (and hence [F : 
H] = oo) then all the limits in part 1 converge to zero exponentially 
fast. 

3. If [F : H] < oo then 



lim sup 

h 

lim sup 



> 0, lim sup 



(2k)< 



> 0, lim sup 



(2k) 



(2k - l) n 
> 0. 



>0, 



Proof. Parts 1 and 2 of this statement follows immediately from Theorem 5.7 



and Proposition 5.8. We will now establish part 3 Theorem 6.3. Note that 
r n > a n and z n > b n . Thus it suffices to check that lim-sups involving a n 
and b n are positive. Since [F : H ] < oo, there is a normal subgroup of finite 
index Hi < F such that Hi < H < F. Then a n (F,H,A) > a n (F,Hi,A) 
and b n (F, H, A) > b n (F, Hi, A). So it suffices to consider the case where H is 
normal of finite index p in F. In this case the coset graph T = T(F, H, A) is 
finite and has p vertices. Thus T is amenable and a(T) = 2k — 1, (3(T) = 2k. 
Then by the results of W.Woess pi] and L.Bartholdi ilC 



lim sup 



(2k - 1) 



lim sup 



(2k)' 



Thus Theorem 6.3 is proved 



~ if T has some odd-length circuits 
- if r has only even-length circuits 

□ 



When H is a normal subgroup of F, the first part of Theorem |6.3| is originally 
due to W.Woess [p3||. One can obtain much more precise statements than 



Theorem 6.3, where the denominators are replaced by powers of the co- 



growth rate of H, but Theorem 6.3 is quite sufficient for our purposes. 



We refer the reader to M, 25, 34 



7. The Membership problem 

,31] for the background information 



on hyperbolic and automatic groups and their rational subgroups. We will 
recall several relevant definitions and results. 

Definition 7.1. Let G be a group with a finite generating set A. Let L 
be a language over A U A^ 1 such that n(L) = G, where it is the natural 
map from the free semigroup on A U A^ 1 to the group G. Let H < G be a 
subgroup. 

1. The subgroup H < G is said to be L-rational if the set 

L H := {w G L | ir(w) G H} 

is a regular language and H = tt(Lh). 

2. The subgroup H < G is said to be L-quasiconvex if there exists K > 
such that for any initial segment u of a word w G Ljj there is a word 
v of length at most K such that tt(uv) G H. 

An important observation of S. Gersten and H. Short [51] states that: 
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Proposition 7.2. Let G be a group with a finite generating set X and let L 

be a language over X U X^ 1 such that vr(L) = G. Let H < G be a subgroup. 
Then H is L-rational if and only if H is L-quasiconvex. 

As the example of cyclic subgroups of G = Z x Z illustrates, it is possible 
that a particular subgroup is rational with respect to one automatic struc- 
ture on G but not the other. However, rationality is invariant in a somewhat 
weaker sense: 

Proposition 7.3. Let G be an automatic group with a finite generating set 
A and an automatic language L over AuA^ 1 . Let H < G be an L-rational 
subgroup. Then for any finite generating set B of G there is an automatic 
language V over B U B -1 for G such that H is V -rational. 

Suppose further that G is word-hyperbolic. Then for any finite generating 
set B of G and for any automatic language L' over B U B^ 1 for G the 
subgroup H is V -rational. 

Proof. The statement regarding hyperbolic groups is well-known and reflects 
the fact that for word-hyperbolic groups all possible notions of quasiconvex- 
ity for subgroups coincide. 



The statement about automatic groups follows from the results of [27], 



although it is not stated there directly. Indeed, Theorem 2.4.1 of [27] proves 



that given G, A, L as in Proposition 7.3, for any finite generating set B of G 



there is an automatic language V for G over B U J3 . The proof actually 
shows that any regular sub-language of L gets "translated" into a regular 
sub-language of L' with the same image in G. In this process Lh gets 
"translated" in L' H and hence L' H is regular, as required. □ 

Because of Proposition it is natural to adopt: 

Definition 7.4 (Rational Subgroup). Let G be an automatic group and let 
H < G be a subgroup. 

We say that H is rational in G if there exists an automatic language L 
for G such that H in L-rational. 

If G is word-hyperbolic then a rational subgroup is also often referred to 
as quasiconvex. 

Proposition 7.5. Let G be an automatic group and let H < G be a rational 
subgroup. Then: 

1. For any finite generating set A of G there is an algorithm which solves 
the membership problem for H in G in quadratic time. 

2. Suppose that G is word-hyperbolic. Then for any finite generating set 
A of G there is an algorithm which solves the membership problem for 
H is G in linear time. 



Proof. Both of these statements are very well-known (see [27, 31, |29|), but 
we will indicate how the algorithm works. 

To see (1) suppose that A is a finite generating set of G. Then there is an 
automatic language L over AU A^ 1 for G such that Lh is regular. Given an 
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arbitrary word w over A U A~ we first apply the quadratic-time algorithm 
of [27 1 to take w to a normal form in L, that is to find w' £ L such that 
w and w 1 represent the same element of G. Since an automatic language L 
consists of quasigeodesics p7| , we have \w'\ < c\w\, where c is some constant 
independent of w. We then check whether or not w' G Lh (which can be 
done in linear time in terms of \w'\). The total expanded time is clearly 
quadratic in \w\. 

For a hyperbolic group G and a rational subgroup H < G the algo- 
rithm solving the membership problem in linear time is virtually identical. 
First, for any finite generating set A of G there is a finite presentation of 
G as G = (A\R) such that all Dehn-reduced words for this presentation 
are quasigeodesics (To see this one has to choose R large enough and use 
the fact that local geodesies in the T(G, A) are global quasigeodesics, pro- 
vided the "local" parameter is chosen to be sufficiently large |32|). It 
is obvious that the set L of all Dehn-reduced words is regular. Moreover, 
H < G is rational implies that H is a quasiconvex subset of T(G, A). Hence 
H is L-quasiconvex since in a hyperbolic metric space a quasigeodesic and a 
geodesic with common endpoints are Hausdorff-close (again, see J2| |32||). 
Therefore H is L-rational by Proposition 7^2 and so Lh is a regular lan- 
guage. Unlike the general case of an automatic group, as we mentioned 
earlier there is a linear time algorithm which takes a word w over A to its 
Dehn-reduced form vJ (see [§, §) where \w'\ < \w\. The algorithm solving 
the membership problem for H in G now works exactly as in the general 
automatic case. □ 



Theorem B. Let G be a finitely generated group and let H < G\ < G where 
G\ has finite index in G. Suppose there is an epimorphism <p '■ G\ — > G such 
that H = <f>(H) is contained in a subgroup K < G of infinite index in G 
such that the membership problem for K in G is in the complexity class C. 
Then the membership problem for H in G has generic- case complexity in 
C. Moreover, if the coset graph of T(G, K) is non-amenable (for some and 
hence any finite generating set A of G), then the generic-case complexity of 
the membership problem for H in G is strongly in C. 

Proof. Let A = {x\, . . . ,Xk} be a finite generating set for G and let B be 
some finite generating set of G\. Let ir : F — > G be the canonical epimor- 
phism corresponding to the presentation G = (x±, . . . ,Xk\u±, . . . , u m , • • • ), 
where F = F( Xl , . . . , x k ). Let K x := ^(K) < G\ < G and let K 2 := 
^{Kx) < F. Note that [G x ■ K x \ = \G : K] = oo and hence [F : K 2 ] = 
co. Moreover, the Schreier coset graphs T(F, A) = T(G, K X ,A) and 
Y(G X ,K\,B) = T(G,K,B) are quasi-isometric since G\ has finite index 
in G. Thus T(F, K 2 , A) is non-amenable if and only if T(G,K,B) is non- 
amenable. 
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Moreover H < K x . Thus if w £ (A U A" 1 )* - {K 2 )a then ir(w) eG-H. 
Let z n = z n (F, K2, A) let C n = ^^-1 1 be the number of words in (A U 
A^ 1 )* of length at most n. 

Since [F : K2] = 00, Theorem implies that {Ki)a has zero asymptotic 
density in (A U A^ 1 )* , that is 

hm — — = and hm — — = 1, 

and in both cases the convergence is exponentially fast if T(G, K, B) is non- 
amenable. Thus the set (A U A^ 1 )* — (K2)a is generic (and even strongly 
generic if T(G, K, B) is non-amenable). 

Fix a finite right Schreier transversal T for Gi in G so that 1 £ T, 
\T\ = [G : G\] and G = Ut^TGit. Also fix the finite Schreier coset graph 
T(G, Gi, A). Recall that a Schreier rewriting process for G\ in G consists in 
rewriting a word hi £ (iU A^ 1 )* as a word where v £ (B U B^ 1 )* and 
i € T, so that ft and u> represent the same element of G. Thus wj represents 
an element of G\ if and only if t = 1. We recall, briefly, how the Schreier 
rewriting process works. For every t S T and x £ AU A^ 1 we fix a word 
u(t, x) £ (B U i? -1 )* and an element s(t, x) £ T such that tx = u(t, x)s(t, x) 
in G. Given a word w = x\ . . . x n £ (AU A^ 1 )* , where each xi £ AU A , we 
rewrite it as follows. First 1 • x\ = u(l, xi)s(l, xi). If x\ . . . Xi has already 
been rewritten as itjtj, where U{ £ (B U i? -1 )* and ti £ T, then 

X\ . . . XiX i+ l = UitiX i+ i = UiU(ti,X i+ i)s(ti, X i+ l). 

Thus we put Ui + i = Uiu(ti,Xi + i), ij+i = s(ti, Xi + \) and continue to the next 
step. At the end of the process we rewrite w as ut, where u is a word in 
BUB" 1 and t £T. This rewriting process requires at most a linear amount 
of space and time in termsof \w\. Moreover, we have |u| < C|u>|, where 
C = max{|n(t,x)| : t £ T,x £ AU A' 1 }. 

We will now construct a correct partial algorithm f2 for the membership 
problem of H in G as follows. Let w be a word in (A U ^4 -1 )*. Denote 
by g the element of G represented by w. First we read the word uu in 
the finite Schreier graph T(G, G\,A) starting from the vertex G\ ■ 1 and 
simultaneously apply the Schreier rewriting process to w. If the terminal 
vertex of the resulting path is different from G\ ■ 1, then ir(w) G\ and 
hence tt(w) H. We declare that w MP(G, H, A) and terminate £1 in 
this case. 

If the resulting path ends at G\ ■ 1 then ir(w) £ G\ and we have rewritten 
w as a word v in (B U B -1 )*. Note that \v\ < C|u>| where C > is some 
constant independent of w. 

By assumption the membership problem for K in G is solvable with com- 
plexity C. We apply this algorithm to the word v. If the element g of G 
represented by v does not belong to K, then g ^ H. Hence the element g 
of G represented by w and v does not belong to H. In this case we declare 
that w MP(G, H, A) and terminate U. 
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If it turns out that v represents an element of K, we terminate £1 without 
an answer. 

The algorithm Q terminates with a correct answer for every w (K^jA- 
Since the set (A U A~ 1 )* — {K^)a is generic (and even strongly generic if 
T(G,K,B) is non-amenable), the statement of Theorem B holds. □ 

Our theorem on the word problem is an immediate corollary. 

Theorem A. Let G = (xi, Xk\R) be a finitely generated non-cyclic group. 
Suppose that G has a finite index subgroup that possesses an infinite quotient 
group G in which the word problem is solvable in the class C. Then the word 
problem for G has generic-case complexity in the class C. 

Moreover, if the group G is non-amenable, then the generic-case complex- 
ity of the word problem for G is strongly in C 

Proof. Let G\ < G be a subgroup of finite index and let : G\ — > G be 
an epimorphism as in the statement of Theorem A. Put H = {1} < G 
and K = {1} < G. Thus (ft(H) < K. Moreover, the membership problem 
for H in G is precisely the word problem for G. Similarly the membership 
problem for K in G is precisely the word problem for G. Now the conclusion 
of Theorem A follows from Theorem B. □ 



Remark 7.6. Theorem ^3 shows that the statements of both Theorem A 
and Theorem B remain true if we define asymptotic density and genericity 
in terms of subsets of F(A) (rather than subsets of (Au A^ 1 )*) by counting 
the ratios of the number of freely reduced words from a subset over the 
number of all freely reduced words. 



Corollary 7.7. Let G be a finitely generated group and let H < G\ < G , 

where [G : G\] < oo. Let <fi : G\ — > G be an epimorphism with H = cp(H). 
Then: 

1. Suppose G is word-hyperbolic and H < G is contained in a quasiconvex 
subgroup K of infinite index in G. Then the membership problem for 
H in G is strongly generically in linear time. 

2. Suppose G is automatic and H < G is contained in a rational subgroup 
K of infinite index in G. Then the membership problem for H in G is 
generically in quadratic time. Moreover, if T(G, K) is non-amenable 
then the membership problem for H in G is strongly generically in 
quadratic time. 



Proof. This follows directly from Theorem B and Proposition 7.5. □ 



8. The Conjugacy problem 

Let F = F(xi, . . . , Xk) and let A = {x\, . . . , x^} be a fixed free basis of 
F, where k>2. 
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Convention 8.1. As before, we will denote by C n the number of words of 
length at most n in (A U A' 1 )* . Thus C n = - ■ 

Let Q n be the number of pairs iw\,w^] of words in (A U A" 1 )* with 
|^i| + | < n. 

Note that if \wi\ + | W2 1 = i < n then jwiu^l = i < n. For a fixed word 
^ of length i there are (i + 1) ways of representing w as, w = w\W2- Recall 
that A U A^ 1 consists of 2k letters. 

Hence: 



n 
i=0 



Proposition 8.2. Let H < F be a subgroup of infinite index. Let S C 
(^4 U A^ 1 )* x (^4 U A^ 1 )* be the set of all pairs (u>i, u>2) with \wi \ + \w2\ < n 
such that wiw^ 1 represents an element of H . Then {>a(S) = 0. 

Proof. Let bj = bj(F, H, A) be the number of all words of length j repre- 



senting elements of H. Then by Theorem ^3 linin^oo bj/(2k) J = since H 
has infinite index in F. 

Suppose {W1.W2) is a pair of words such that |u>i| + \u)2\ = i < n and that 
the words w := wiw^ 1 represents an element of H. For a fixed word w of 
length i representing an element of H there are i + 1 ways of writing w as 
w = W1W2 1 . Hence 

n 

^(5) = + l)bi. 



i=0 



Therefore 



l im ^1 = u m y=o(» + l)&i = (by gtoltz , Theorem) 

n->oc g n 2^ i=0 (i + l)(2k) 1 

= lim — — 7- " = lim . " = 0, 
n^oo ( n + l)(2A;) n (2&) n 

as required. □ 



Theorem C. Let G be a non-cyclic finitely generated group with infinite 
abelianization. Then the generic-case complexity of the conjugacy problem 
for G is linear time. 

Proof. Let G be the abelianization of G and let (ft '■ G — * G be the abelian- 
ization map. Let F = F(xi, . . . ,Xk), A = {x\, . . . ,Xk} and let ir : F — > G be 
the presentation epimorphism. Let H < G be H := Ker(<ft on). As before, 
let Ha be the set of all words in (A U A" 1 )* representing elements of H. 
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Let 



S := {{wx, w 2 ) £(AU A' 1 )* x(AU A )* \ ^{ Wl )) = ^(ir(w 2 ))} = 

= {{wi,w 2 ) G (AUA- 1 )* x (^U^ 1 )*!^!^ 1 eH A }. 

By Proposition ^ Pa(S) = 0. If (wi,W2) S then ^(vr^i)) / 4>{tt(w2)) 
and hence </>(7r(u;i)) is not conjugate to (J)(-k(w2)) in G (since G is abelian). 
Thus if {w\,W2) S then 7r(wi) is not conjugate to ■k{w2) in G. 

Since G is finitely generated abelian, there is an algorithm VL which solves 
the word problem for G in linear time. Hence for any pair (wi,W2) S 
with \wi\ + 1 102 1 < the algorithm f2 will terminate in linear time of n and 
declare that 4>(-k(wi)) 7^ 0(7r(w2)) and hence n(wi) is not conjugate to tt(w2) 
in G. □ 

9. Some general observations on generic-case complexity 

As mentioned in Sectionintro, we are greatly indebted to Carl Jockusch 
and Frank Stephan for stimulating conversations about some general fea- 
tures of generic-case complexity and the results in this section are due to 
them. First, Carl Jockusch observed that if we put a reasonable measure on 
the set of all languages over an alphabet A with at least two letters, then the 
set of generically computable languages has measure zero. Second, Frank 
Stephan observed that the standard Time Hierarchy Theorem of complexity 
theory can be modified to separate deterministic time classes from generic 
complexity classes. Thus, for example, there is a language L in DTIME(n 3 ) 
which is not in GenTIME(n). 

Fix an alphabet A with at least two letters. A language L over A is 
generically computable if there is a partial algorithm Q such that the set S 
on which Q correctly decides membership in L has p(S) = 1. The canonical 
or shortlex ordering of the set A* of all words on A orders words first by 
length and within length, by the lexicographical ordering induced from a 
linear ordering of A. So we have a listing {w\, ...,w n ....} of A* in which all 
words of a shorter length come before all words of a longer length. We can 
now identify a language L C A* with its characteristic function xl where 



Since such a characteristic function is an infinite sequence (6 n )n>i of 0's 
and l's, we can regard it as the binary expansion of a real number in the 
unit interval [0, 1]. A binary expansion is unique except for those which are 
either all 0's or all l's from some point onwards. A binary representation 
which is all 0's from some point onwards corresponds to a finite subset of 
A* . There are only countably many finite subsets and excluding them gives 
a one-to-one correspondence between the infinite subsets of A* and the half- 
open interval (0, 1]. The standard Lebesgue measure on (0, 1] then gives a 




1 if w n £ L, 
if w n L. 
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measure on the set of infinite subsets of A* and this is the measure which 
we use. 



Theorem 9.1. Let A be a finite alphabet with at least two letters. Fix a 
linear ordering of A and let m be the measure on the set of infinite languages 
over A induced by the shortlex ordering as described above. 

Then the set of languages over A which are generically computable has 
measure zero. 

Proof. It suffices to show that if £1 is any fixed partial algorithm whose 
output is either or 1 then the set of languages which are generically decided 
by O has measure 0. Since there are only countably many algorithms, it then 
follows that the set of all generically decidable languages has measure 0. Let 
u be the infinite sequence of 0's and l's where to(n) = 1 if Q calculates 1 
for w n G A* and u(n) = otherwise. The point is that u is now a fixed 
sequence. 

For an integer K > 1 denote by g(K) the the number of subsets of a set 
with K elements which contain at least 2>K/A elements of that set. We need 
only the fact that the ratio of g{K) over the number 2 K of all subsets of a 
set with K elements goes to as K — > oo. This follows easily from applying 
Stirling's formula and computing the asymptotics of the binomial coefficient 
(3^74)' This computation shows that 



(3X/4) 



= o(a K ) as K — > 00 



2 K 

for some number < a < 1. Hence 

g(K) __ (3*74) + (3*74+1) ~\ ^ (k) < K (3*74) _ 



2 K 2 K ~ 4 2 K K- 



For every integer j > the set A* has exactly s(j) := k \_ l 1 words of 
length < j, where k = #A. Thus the first s(j) digits in the binary sequence 
of a language L determine exactly which words in A* of length at most j 
belong to L. 

Fix an arbitrary e > 0. Take an integer j\ > large enough so that 
^pp- < I for any integer K > s(ji). Let Q\ be the set of all infinite binary 
sequences which agree with the first s(j'i) digits of cj in at least 3s(ji)/4 
positions. Note that for a fixed binary string a of length s(j'i) the measure 
of the set of all infinite binary sequences with initial segment a is 2~ s ^ 1 \ 
Hence m{Q 1 ) < 5(s(ii))2- s (^) < f . 

Now take an integer j'2 > ji large enough so that < ^ for any 

integer K > 5(^2)- Let Q2 be the set of all infinite binary sequences which 
agree with the first 5(^2) digits of uj in at least 3s(j2)/4 positions. Again we 
see that m{Q2) < Continue in this way, choosing at step n an integer 

jn > jn-i large enough so that for any integer K > s(j n ) we have < 



GENERIC-CASE COMPLEXITY 



29 



Let Q n be the set of all infinite binary sequences agreeing with the first s(j n ) 
digits of uj in at least 3s(j n )/4 positions. Then m(Q n ) < 
Put Q = U^ =1 Q„. Then 

oo 

m(Q)<]T^ = e . 

n=l 

Now suppose that L is any language generically decided by Q. Be our 
choice of the enumeration of A* and by the definition of generic computabil- 
ity, there exists an integer constant i > such that for any j > i the binary 
sequence of L agrees with the initial segment of uj of length s(j) in at least 
3s(j)/4 positions. Choose n such that j n > i. Then \L S Q n Q Q by 
construction of Q n . 

Thus we have shown that for any e > the set of all languages generi- 
cally computable by Q can be covered by a set of measure at most e. As 
required, this implies that the set of languages generically computable by ft 
has measure zero. □ 

The following theorem is due to Frank Stephan. Recall that we are fol- 
lowing the definitions and notations of |5l]] for computational complexity. 
A proper complexity function f is a non-decreasing function for which there 
is a multi-tape Turing machine which, on an input w computes the string 
i/(M) i n 0(\w\ + f(\w\)) steps and uses 0(f(\w\)) space besides its input. 
The reason for insisting on proper complexity functions is that they can be 
used as "clocks" when simulating Turing machines. One effectively assigns 
a word j(M) on a fixed alphabet A which codes the Turing machine M. 
There is a universal Turing machine U which, when given as input a word 
y(M)w, simulates the machine M on the input w. (We can assume that w 
is a word in the alphabet {0, 1}.) If / is a proper complexity function we 
can define a time-bounded version of the Halting Problem by 



H(f) = {y(M)w : M accepts w in at most f{\w\) steps}. 



The following statement is Lemma 7.1 of [51] which shows that, given the 
code of a Turing machine M, we do not need more than time f 3 (\w\) to 
simulate M for steps on an input w. 



Lemma 9.2. H(f) G DTIME(/ 3 (n)). 

Using the lemma we can prove 

Theorem 9.3. If f(n) > n is a proper complexity function then there is a 
language L C {0, 1}* which is computable in time f 3 (n)) but not generically 
computable in time f(n). 

Proof. The idea of the proof is that for each Turing machine M we specify 
infinitely many lengths devoted to "defeating" the machine M. We can do 
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this by using ordered pairs. Let N + denote the set of positive integers. The 
standard one-to-one enumeration, often called the "pairing function", 



p : N + x N + -> N + 

is given by a simple formula and its inverse function p~ l is also easily com- 
putable, certainly in cubic time. We define the language L as follows. If w 
is a word on {0, 1} , let n = \w\ and calculate p _1 (n) = (r, s). If r is not the 
code 7(M) of a Turing machine then w L 

If r = 7(Af ) for some Turing machine M, we simulate the action of M on 
the input w for f(\w |) steps. By Lemma ^2 this requires at most 0(f 3 (\w\)) 
steps. Put w in L if and only if M did not accept to in f(\w\) steps. 

By construction we have L G DTIME(/ 3 (n)). On the other hand, if L 
were in GenTIME(/(n)) then there would exist a Turing machine M' and 
an integer n such that for all m > n the machine M' correctly decides 
membership in L' on at least three-quarters of all words of length less than 
or equal to m. Let r = j(M'), let s > n and let t = p~ 1 (r,s). Note that 
t > n. By construction, M' does not correctly decide membership in V for 
any words of length t in time f(t). But more than half of the words of length 
less than or equal to t have length exactly t. Hence M' fails to generically 
decide L' in time f(n), yielding a contradiction. □ 
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